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LETTER TO THE EDITOR 



Dear Editor, 

The Lsm (Sm-like) family of proteins, characterized 
by the Sm fold [1] and conserved among eukaryotes, 
plays an important role in RNA biogenesis. The Sm and 
Lsm complexes play an essential role in pre-mRNA 
splicing [2]. Of the five small ribonucleoproteins (sn- 
RNPs), four (Ul, U2, U4, U5) contain the Sm heptamer 
ring, whereas the U6 snRNP contains a specific Lsm2- 
8 heptamer that comprises Lsm2, Lsm3, Lsm4, Lsm5, 
Lsm6, Lsm7, and Lsm8 [3]. Another Lsm heptameric 
complex, Lsml-7, which differs from Lsm2-8 by one 
Lsm protein, is thought to function in mRNA decapping, 
a crucial step in the mRNA degradation pathway [4]. 

In yeast, mRNA degradation begins with deadenyl- 
ation catalyzed mainly by the deadenylase Ccr4-Not 
complex to shorten the polyA tail [5]. Subsequently, 
whether the 3 '-end of mRNA is bound by the Lsml-7 
complexes is an essential determinant for the directional- 
ity of mRNA degradation. In the absence of Lsml-7, the 
exposed 3 '-end of mRNA is captured by exosome and 
degradation begins from the 3 '-end [6]. In the presence 
of Lsml-7, the 3 '-end of mRNA is recognized by the 
Patl -Lsml-7 complex [6]. Then the decapping enzymes 
Dcpl and Dcp2 are recruited to decap mRNA, followed 
by Xrnl -mediated degradation from the 5 '-end [7]. 

The Sm and Lsm proteins exhibit contrasting func- 
tions and distinct biochemical properties. First, the Sm 
proteins are located in the nucleus and only fiinction in 
the spliceosome pathway of mRNA splicing. The Lsm 
proteins appear both in the nucleus and cytoplasm and 
are involved in various pathways of RNA metabolism. 
Second, the Sm proteins only assemble into a heptamer 
ring in the presence of snRNA, whereas the Lsm proteins 
can form two stable heptamer rings by themselves, the 
Lsml-7 and Lsm2-8 complexes. The Sm ring binds the 
central part of snRNA whereas the Lsm2-8 ring recog- 
nizes the 3 '-end of U6 snRNA [8]. 

Unlike the Sm proteins, whose heptameric ring can be 
readily assembled from stable heteromeric subcomplex- 
es, the individually purified Lsm proteins, either alone 



or in heteromeric forms, failed to form a correctly as- 
sembled heptamer in the absence of denaturation/refold- 
ing. Lsm3 is known to form a stable homo-octamer by 
itself [9]; co-expression of Lsm5, Lsm6, and Lsm7 led to 
formation of a stable, but non-fiinctional, hetero-hexamer 
[10]. In fact, every combination of hetero-dimeric or 
hetero-trimeric Lsm subcomplex would result in the for- 
mation of a stable oligomer, which can be reconstituted 
into a heptameric ring only by denaturation and refolding 
[11]. After numerous trials, we succeeded in co-expres- 
sion of all seven Lsm proteins from S. cerevisiae using 
the pQLink plasmid [12]. The Lsml-7 complex was 
biochemically purified to homogeneity (Supplementary 
information. Figure SI). Analysis by mass spectrometry 
confirmed the presence of all seven Lsm proteins. 

Despite rigorous trials, we were unable to crystallize 
the Lsml-7 complex. To solve this problem, we launched 
a systematic protein engineering effort that involved re- 
moval of flexible sequences and mutation of cysteine to 
serine. Eventually we crystallized the Lsml-7 complex 
in the P21 space group (Supplementary information. 
Figure S2). To facilitate structure determination, we also 
generated and crystallized the selenomethionine-labeled 
Lsml-7 complex in the PI space group. The structure of 
Lsml-7 was determined by a combination of selenium- 
based single anomalous dispersion (SAD) and molecular 
replacement using the atomic coordinates of an archael 
Sm-like homo-heptamer (PDB code 1181 [13]). The final 
atomic model was refined at 3.0 A resolution (Supple- 
mentary information. Table SI and Figure S3). 

The overall appearance of the heptameric Lsml- 
7 complex from S. cerevisiae resembles a thick donut, 
with an outer diameter of approximately 70 A, an inner 
diameter of 5 A, and a thickness of 45 A (Figure lA). 
The seven Lsm components sequentially interact with 
each other to form a closed ring, with the order of Lsm 1- 
Lsm2-Lsm3-Lsm6-Lsm5-Lsm7-Lsm4. Except for Lsml, 
each component within the ring only interacts with two 
neighboring Lsm proteins. For example, Lsm7 is in con- 
tact only with Lsm5 and Lsm4. The C-terminal sequence 
of Lsml forms an extended a-helix that crosses over the 
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Figure 1 Structure of the Lsm1-7 heptameric complex from S. cerevisiae. (A) Overall structure of the Lsm1-7 complex in two perpen- 
dicular views. The seven Lsm subunits are color-coded. The central hole of the Lsm1-7 ring is partially blocked by a characteristic 
a-helix from the C-terminus of Lsm1. The crystallized Lsm1-7 complex contains five full-length Lsm proteins (Lsm2/3/5/6/7), residues 
1-93 of Lsm4, and residues 30-173 of Lsml. To facilitate crystallization, Cys45 of Lsm2 and Cys37/Cys63 of Lsm3 were mutated 
to Ser. (B) The Lsml subunit adopts an extended appearance in the Lsml -7 ring. Unlike the other six Lsm subunits, the N- and C- 
terminal sequences of Lsml have well-defined structures that extend from the conserved Sm core fold. (C) Residues from the C- 
terminal sequence of Lsml interact with Lsm3 through hydrogen bonds (H-bonds) and van der Waals contacts. (D) Residues from the 
C-terminal sequence of Lsml interact with Lsm6 through van der Waals contacts. (E) Asp36 of Lsml forms a salt bridge with Lys8 of 
Lsm2. (F) Structural overlay of the Lsml -7 and Lsm2-8 complexes. These two complexes exhibit an RMSD of 0.89 A over 461 aligned 
Ca atoms. Components of the Lsm1-7 ring are color-coded, whereas the Lsm2-8 ring (PDB code 4M78) is shown in grey. (G) Structur- 
al comparison between Lsml and Lsm8. (H) The Lsml -7 ring binds to RNA sequences with micromolar affinities. Lsml -7 only exhibits 
3-fold lower binding affinity for the octa-nucleotide 5'-AAAAAAAA-3' compared to 5'-UUUUUUUU-3'. By contrast, the Lsm2-8 complex 
preferentially binds the oligo-U element but exhibits a similar binding affinity for the oligo-A sequence as Lsml -7. (I) Structural align- 
ment of the conserved Arg residues between Lsm1-7 and Lsm2-8 (PDB code 4M7A). The residues in Lsm1-7 are colored, whereas 
the residues in Lsm2-8 are shown in grey. (J) Biochemical analysis of RNA binding by variants of the Lsm1-7 complex. The values 
in H and J were measured by Biolayer Interferometry (BLI) on a Fortebio Octet system. Each experiment was independently repeated 
three times, with the average value and standard deviation shown here. 



ring, with the ensuing loop associating with Lsm3 and 
Lsm6. The central hole of the Lsm 1-7 ring is partially 
blocked by this a-helix from Lsml, leaving a very small 
opening at the center of the ring. This structural feature 
contrasts that in the Lsm2-8 complex, where the central 
hole has a diameter of 15 A [8]. 

Unlike the other six Lsm components, the N- and C- 
terminal sequences of Lsml have well-defined structures 
that extend out from the conserved Sm core fold (Figure 
IB) and interact with neighboring Lsm subunits in the 
heptameric Lsm 1-7 complex. Notably, a few amino ac- 
ids from the C-terminal sequence of Lsml interact with 
Lsm3 and Lsm6 through hydrogen bonds (H-bonds) and 
van der Waals contacts (Figure IC and ID). In addition, 
Asp36 at the N-terminal loop of Lsml interacts with 
Lys8 of Lsm2 through a salt bridge (Figure IE). These 
specific interactions likely constitute the basis why Lsml, 
but not any other of the Lsm proteins, can replace Lsm8 
to form a heptameric complex with Lsm2/3/4/5/6/7. 

The Lsm 1-7 complex shares six common components 
with the heptameric Lsm2-8 complex, with Lsml in 
the Lsm 1-7 complex substituted by Lsm8 in the Lsm2- 
8 complex. Consequently, the overall structures of these 
two complexes are very similar, with a root-mean-squared 
deviation (RMSD) of 0.89 A over 461 aligned Ca atoms 
(Figure IF). The main difference is between Lsml and 
Lsm8, which can be aligned to each other with an RMSD 
of 0.97 A over 54 aligned Ca atoms in the core Sm fold 
(Figure IG). Notably, these 54 aligned Ca atoms come 
from residues 32-115 of Lsml and residues 1-67 of Lsm8. 

Although the N- and C-terminal sequences of Lsml 
mediate interactions that are unique to the Lsml -7 
complex, the core Sm fold of Lsml appears to make a 
conserved set of interactions compared to Lsm8. Specifi- 
cally, interactions between Lsml and Lsm2, including 
both main chain H-bonds and side chain van der Waals 



contacts, are similar to those between Lsm8 and Lsm2. 
The majority of the Lsml-Lsm4 interactions are also 
preserved in the Lsm8-Lsm4 interface (Supplementary 
information. Figure S4). 

The Lsml -7 complex plays an important role in RNA 
metabolism by facilitating degradation of mRNA. In 
yeast, the Lsml-7 complex binds the 3'-end of mRNA 
and localizes to the P-bodies, ultimately resulting in 
mRNA degradation from the 5 '-end [4]. We examined 
the RNA binding affinities of the Lsml-7 complex using 
biolayer interferometry (BLI) on a Fortebio Octet sys- 
tem. The Lsml-7 complex exhibits binding affinities of 
approximately 6 (xM and 2 |iM for the octa-nucleotides 
5'-AAAAAAAA-3' and 5'-UUUUUUUU-3', respective- 
ly (Figure IH). The binding affinity difference between 
the oligo-A and oligo-U sequences is only about 3 -fold, 
suggesting that the Lsml-7 complex may not recognize 
RNA sequences with the same level of specificity as the 
Lsm2-8 complex [8]. By contrast, the Lsm2-8 complex 
exhibits a binding affinity of about 20 nM for the oligo- 
U sequence, approximately 200-fold tighter than that for 
the oligo-A sequence (Figure IH). These results suggest 
that the RNA binding mode of the Lsml-7 complex may 
be similar to that of the Lsm2-8 complex towards the 
oligo-A, but not the oligo-U, sequence. 

Although Lsml and Lsm8 share limited sequence 
similarity, the RNA-binding residues in Lsm8 are largely 
preserved in Lsml. For example, the D31XXXN35 and 
R57GX motifs in Lsm8 correspond to DyjxXxNy^ and 
RiojGX in Lsml, respectively. In addition, modeling 
studies suggest that the Lsml-7 complex can bind to the 
3 '-end of oligo-U sequence similarly as the Lsm2-8 com- 
plex. However, the oligo-U binding affinity of Lsml-7 is 
approximately 100-fold lower than that of Lsm2-8 (Figure 
IH). Obviously, the sequence variation between Lsml 
and Lsm8 helps determine their RNA binding affinity as 
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well as specificity. The molecular basis for this observa- 
tion is likely to be revealed by the crystal structure of 
Lsml-7 bound to RNA. 

An invariant Arg residue in Lsm6/3/2/8/4 plays an 
important role in RNA recognition by the Lsm2-8 com- 
plex [8] (Figure II). We generated seven variants of the 
Lsml-7 complex, each containing replacement of the 
conserved Arg (Ser in Lsm5) by Ala in one Lsm com- 
ponent, and examined their interactions with the oligo- 
A and oligo-U RNA elements (Figure IJ). As recently 
reported [8], mutation of the conserved Arg to Ala in any 
of the Lsm6/3/2/8/4 subunits results in drastic reduction 
of binding affinity between the 3'-end U-rich sequence 
of U6 snRNA and the Lsm2-8 complex. In the Lsml-7 
complex, such point mutation in Lsm2 or Lsm3 caused 
the most pronounced reduction of RNA binding affinity 
to the oligo-U sequence (Figure IJ). By sharp contrast, 
such point mutation only caused modest changes (less 
than 2-fold) in binding affinities for the oligo-A se- 
quence. This result is consistent with the observation that 
the Lsml-7 complex exhibited a weak specificity for the 
oligo-U sequences (Figure IH). 

In summary, we report the crystal structure of the 
heptameric Lsml-7 complex and preliminary charac- 
terization of its RNA-binding properties. Our structural 
and biochemical characterization serves as a framework 
for mechanistic understanding of the function of Lsml-7 
complex in RNA metabolism. In the final phase of manu- 
script preparation, we noted publication of two related 
manuscripts [14, 15], of which Sharif et al. [14] reported 
the crystal structure of Lsml-7 without RNA binding 
studies. 
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